Speaker Recognition Using Neural Tree Networks

نویسندگان

Kevin R. Farrell

Richard J. Mammone

چکیده

A new classifier is presented for text-independent speaker recognition. The new classifier is called the modified neural tree network (MNTN). The NTN is a hierarchical classifier that combines the properties of decision trees and feed-forward neural networks. The MNTN differs from the standard NTN in that a new learning rule based on discriminant learning is used, which minimizes the classification error as opposed to a norm of the approximation error. The MNTN also uses leaf probability measures in addition to the class labels. The MNTN is evaluated for several speaker identification experiments and is compared to multilayer perceptrons (MLPs) , decision trees, and vector quantization (VQ) classifiers. The VQ classifier and MNTN demonstrate comparable performance and perform significantly better than the other classifiers for this task. Additionally, the MNTN provides a logarithmic saving in retrieval time over that of the VQ classifier. The MNTN and VQ classifiers are also compared for several speaker verification experiments where the MNTN is found to outperform the VQ classifier.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...

متن کامل

Mlp Mlp Rbf Rbf Gn Gn Cart Ntn

Speaker independent vowel recognition is a di cult pattern recognition problem. Recently therehas been much research using Multi-Layer Perceptrons (MLP) and Decision Trees for this task. Thispaper presents a new approach to this problem. A new neural architecture and learning algorithmcalled Neural Tree Networks (NTN) are developed. This network uses a tree structure with a neural<l...

متن کامل

Using neural network to estimate weibull parameters

As is well known, estimating parameters of the tree-parameter weibull distribution is a complicated task and sometimes contentious area with several methods vying for recognition. Weibull distribution involves in reliability studies frequently and has many applications in engineering. However estimating the parameters of Weibull distribution is crucial in classical ways. This distribution has t...

متن کامل

A Comparative Study of Gender and Age Classification in Speech Signals

Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...

متن کامل

Isolated Voiced Digit Recognition Using Inductive Inference

This paper proposes the use of inductive inference "decision trees" for isolated digit recognition. The aim of this research is to demonstrate that inductive learning can provide an alternative approach to existing automatic speech recognition techniques such as Dynamic Time Warping (DP), Hidden Markov Modelling (HMM) and Neural Networks (NN). The construction of the decision tree is based on C...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1993

Speaker Recognition Using Neural Tree Networks

نویسندگان

چکیده

منابع مشابه

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

Mlp Mlp Rbf Rbf Gn Gn Cart Ntn

Using neural network to estimate weibull parameters

A Comparative Study of Gender and Age Classification in Speech Signals

Isolated Voiced Digit Recognition Using Inductive Inference

عنوان ژورنال:

اشتراک گذاری